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Abstract. We give sufficient conditions for a shift space (E, a) to be 
intrinsically ergodic, along with sufficient conditions for every subshift 
factor of E to be intrinsically ergodic. As an application, we show that 
every subshift factor of a /3-shift is intrinsically ergodic, which answers 
an open question included in Mike Boyle's article "Open problems in 
symbolic dynamics". We obtain the same result for S-gap shifts, and 
describe an application of our conditions to more general coded systems. 
One novelty of our approach is the introduction of a new version of the 
specification property that is well adapted to the study of symbolic 
spaces with a non-uniform structure. 



1. Introduction 

We study uniqueness of a measure of maximal entropy (or mme for short) 
in the context of symbolic dynamics. Dynamical systems with a unique mme 
are called intrinsically ergodic. Determining which dynamical systems are 
intrinsically ergodic is a central problem at the interface of ergodic theory 
and top ological dynamics [Hof79l IH0T8TI IGur72l IWeTTOl IBuz97l IBG071 IBF091 
IBow74] . and is a prototypical result for the thermodynamic formalism, a 
powerful tool for studying the statistical properties of a system. 

Parry |Par64| and Weiss |Wei70t IWei73| established intrinsic ergodicity 
for topologically transitive shifts of finite type (SFTs), and all their subshift 
factors (sofic shifts). Given a property defining a class of shifts, it is natural 
to ask whether this property implies intrinsic ergodicity, and whether it 
is preserved by passing to factors. This is known to be the case for the 
specification property [Bow 74] . but beyond specification, less is known. 

Our motivating examples are the family of /3-shifts and the family of S- 
gap shifts, which do not have specification (see $3|). In particular, we answer 
the following open problem posed by Klaus Thomsen, which is Problem 28.2 
of Mike Boyle's article "Open problems in symbolic dynamics" |Boy08|: 

Problem. Must a subshift factor of a /3-shift be intrinsically ergodic? 

Theorem A. Every subshift factor of a (3 -shift or S-gap shift is intrinsically 
ergodic. Moreover, the unique measure of maximal entropy can be charac- 
terised as the (well defined) weak* limit as n — > oo of 5-measures evenly 
distributed across periodic points of period at most n. 
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Theorem 12] is proved via our more general main result (Theorem ICl) . 
which provides natural sufficient conditions for an abstract shift space to be 
intrinsically ergodic. These conditions may be viewed as a weakening of the 
specification property, and are well behaved under the operation of taking 
factors. 

Our conditions take a particularly simple form for the class of coded 
systems, which includes /3-shifts and S-gap shifts. A shift space IcS p = 
is coded if there exists a countable collection of finite words, 
called generators, such that X is the closure of the set of sequences obtained 
by freely concatenating the generators. Given a set of generators for a coded 
system, let c n be the number of words of length n that appear either at the 
beginning or the end of some generator. 

Theorem B. Let (X,o~) be a coded shift and let c n be as above. 

(1) 7/hm n _j.oo ^logc n < ht p(X,a), then (X,a) is intrinsically ergodic. 

(2) If dinv^oo i logc n = 0, then every sub shift factor of (X,o~) is intrin- 
sically ergodic. 

Moreover, under these conditions, the unique measure of maximal entropy is 
the weak* limit of 5 -measures evenly distributed on periodic points of period 
at most n. 

A crucial ingredient in our approach, and a result of interest in its own 
right, is that under our conditions, the unique measure of maximal entropy 
satisfies a certain Gibbs property. We also give a sufficient condition for 
the unique measure of maximal entropy to be obtained as the weak limit of 
periodic orbit measures. 

We note that by expansivity, a subshift over a finite alphabet always has at 
least one measure of maximal entropy, so the main content of the theorem is 
uniqueness. There are many examples of non-intrinsically ergodic subshifts 
in the literature [DGS761 Pet86j . One can even construct minimal subshifts 
with arbitrarily many measures of maximal entropy [DGS761 Example 27.2]. 

Beyond the shifts with specification, various classes of shift spaces have 
been shown to be intrinsically ergodic, but none of these classes are closed 
under passing to factors. For example, a class of shifts called almost sofic 
was introduced by Petersen in [Pet86] . Many of these shifts are intrinsically 
ergodic, but not all, and Petersen gives an explicit example of an almost 
sofic shift that is intrinsically ergodic but which nevertheless has a subshift 
factor with more than one measure of maximal entropy. 

Thus the class of shifts that are both almost sofic and intrinsically ergodic 
is not closed under factors, and Petersen observed that "a useful class of 
almost sofic, intrinsically ergodic systems which contains the sofic systems 
and is closed under the usual dynamical operations such as passing to factors 
has not yet been identified." One merit of our approach is that we are able 
to describe a class of intrinsically ergodic systems which meets Petersen's 
criterion of being closed under passing to factors. 
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We compare our approach with existing techniques for proving intrinsic 
ergodicity beyond specification, focusing on the /3-shifts, although we em- 
phasise that our techniques and results work in a more general setting. Each 
value of P > 1 determines a subshift E^. The /3-shifts are a very natural 
and explicit family of shift spaces which, for generic values of ft, are not 
contained in the usual classes of shift spaces where standard techniques ap- 
ply (eg. SFTs, sofic shifts, shifts with specification). Interest in the /3-shift 
arises from its role as the coding space for the /3-transformation, which has 
a deep connection with number theory (see $3]). 

Intrinsic ergodicity for arbitrary /3-shifts was established independently 
by Hofbauer [Hof79] and by Walters [Wal78]. Hofbauer's approach relies 
on modeling the /3-shift by a countable state topological Markov chain with 
strong recurrence properties, and using a version of the Perron-Frobenius 
theorem to establish intrinsic ergodicity. The approach taken by Walters, on 
the other hand, applies transfer operator methods (related to the Perron- 
Frobenius approach above) directly to the /3-shift. 

Both of these approaches have been extensively generalised, and have 
proved very successful in a multitude of situations. However, it seems prob- 
lematic to adapt these methods to the operation of taking factors. We de- 
velop another approach, which does not use Perron-Frobenius theory, and 
which has more in common with Bowen's elegant proof that expansive maps 
with specification are intrinsically ergodic [Bow 74] . 

Given a shift space X, we write C for the language of X — that is, the 
collection of all finite words that appear in sequences In this context, 

specification is the ability to use a connecting word of a fixed length to glue 
together any two words v, w from the language of the shift space — that is, 
the existence of t £ N such that given any v,w € C, there is a word x £ C 
with length \x\ = t for which the concatenation vxw is once again in £.. 
For shifts with specification, Bowen's proof proceeds by using combinatorial 
arguments to establish a Gibbs property for a certain measure of maximal 
entropy, and then using this Gibbs property to prove uniqueness. 

For more general shifts, topological transitivity guarantees the existence 
of some x € C so that vxw 6 C, but x may be arbitrarily long; this is the 
case for generic /3-shifts. This necessitates a new approach to the estimates 
in Bowen's proof, which no longer hold in their original form. We overcome 
this difficulty by considering a collection of "good" words Q C C on which 
specification holds. We use the structure of to characterise the obstruc- 
tions that prevent a word from being good: the words that do not belong to 
Q are precisely those that end in a word taken from a certain smaller collec- 
tion of words C s . We are able to describe the collection C s very explicitly, 
and the growth rate of the number of words of length n in C s is subexpo- 
nential. This allows us to prove that a uniformly positive proportion of all 
the words of length n in the language are 'good', which in turn allows us to 
establish a weakened Gibbs property and prove uniqueness. 
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For our more general results, including S-gap shifts and general coded 
shifts, we pursue a similar strategy. Given a collection of words Q that 
satisfies the specification property, we characterise the words that do not 
belong to Q explicitly by a collection of their possible endings C s (suffixes) 
and a collection of their possible beginnings C p (prefixes). We require that 
every word can be extended to a word in Q in a suitably uniform manner. 
If the number of words of length n arising from the collections C s and C p 
grows more slowly than the topological entropy, then the shift is intrinsically 
ergodic. 

We mention that our work is in a similar spirit to Buzzi's work on shifts 
of quasi-finite type |Buz05] . of which the /3-shifts are a prime example. The 
key property he assumes is that the number of words that are "constraints" 
grows slower than the entropy. This property is philosopically similar to our 
requirement that a certain collection of words grows slower than the entropy. 
Nevertheless, our conditions do not seem to imply the quasi- finite type prop- 
erty, and definitely do not follow from it. Buzzi was able to show (among 
many other things) that q.f.t. shifts have finitely many ergodic measures of 
maximal entropy, and gave counter-examples to uniqueness. 

Asking for the specification property to hold only for words taken from a 
suitable proper subset of C is a key innovation in our approach, and provides 
the necessary flexibility to deal with shift spaces whose behaviour is a long 
way from being Markov. There has been a resurgence in interest in spec- 
ification properties recently, due to important contributions by Pfister and 
Sullivan (almost specification [PS07|, IYam09^ ITholOj ). and Varandas (non- 
uniform specification |VarlO| ), Our work is very much in the spirit of these 
developments, giving another direction in which to weaken the specification 
property in order to apply to a wider range of examples. 

In ^21 we collect our definitions and state our main result, a condition 
for intrinsic ergodicity, together with results on how this condition behaves 
under factors. In JjJ we discuss in detail the application to /3-shifts, <S*-gap 
shifts, and their factors, showing that Theorem |A] follows from our main 
results. In £JH we discuss coded systems and derive Theorem [Bj In §^M6l 
we prove our main results on intrinsic ergodicity, and their behaviour under 
factors. 



2. Definitions and statement of result 

A topological dynamical system is a compact metric space X together 
with a continuous map f:X—>X. Let Aif(X) denote the space of /- 
invariant probability measures on X. We write ht op (X,f) for the topolog- 
ical entropy of the dynamical system, and h^(f) for the measure-theoretic 
entropy of fj, G A4 f(X). The variational principle [Wal82, Theorem 8.6] 
states that 



ho P (XJ) = sup{V(/) | n e M f (X)}. 
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An invariant probability measure that attains this supremum is called a 
measure of maximal entropy (or mme for short). If such a measure exists 
and is unique, the system is called intrinsically ergodic. 

2.1. Languages for shifts. We begin by recalling the relationship between 
shift spaces and languages. We refer the reader to [BH86 , LM95] for further 
background and proofs. 

Fix an integer p > 2 and let {l,...,p} <N be the collection of all fi- 
nite words in the symbols 1, . . . ,p. Juxtaposition denotes concatenation — 
that is, given two words v = v i • • • v m and w = w\- ■ -w n , we write vw = 
Vl • ■■VmWi ■■■W n . 

A one-sided language C C {1, . . . ,£>} <N is a collection of words such that 

(1) if w € C and v is a subword of w, then v € £; 

(2) if w G £, then there exists a € {1, ... ,p} such that wa S C 

Given a one-sided language C, let X = Xc C be the collection of all 
sequences x\x^ ■ ■ ■ such that 

(2.1) XiXi + \ . . . Xj-iXj € C 

for every 1 < i < j < oo. Then X is a closed a- invariant set, where a is 
the usual shift operator defined on {1, . . . ,p} N ; this is the one-sided shift 
associated with the language C. 

The construction also runs in the converse direction: given a one-sided 
shift X C Sp , the set of all words that appear in sequences x € X is a one- 
sided language. This gives a one-to-one correspondence between one-sided 
shift spaces and one-sided languages. 

If £ is a one-sided language that satisfies the additional condition 

(3) if w G C, then there exists a € {1, ... ,p} such that aw G C, 

then we say that C is a two-sided language. Let X = Xc C S p be the 
collection of all doubly infinite sequences . . . X-iXqX\ . . . such that (12. lj) holds 
for every — oo<i<j<oo. Then Xc is a closed a- invariant set, where 
a is the usual shift operator defined on {1, . . . this is the two-sided 

shift associated with the language C, and is the natural extension of Xc- 
As with one-sided shifts, the correspondence runs both ways. The following 
well known proposition (whose proof we give in Section [6J] for completeness) 
shows that intrinsic ergodicity for Xc is equivalent to intrinsic ergodicity for 
Xc- 

Proposition 2.1. The invariant measures of Xc and Xc can be identified 
by a natural entropy preserving bijection. 

Given a one-sided language £, let C be the union of all subsets of C that 
satisfy ([3]). Then X^ = f] n>0 a n (Xc), and so X^ and Xc have the same 
space of invariant measures. Thus it suffices to consider two-sided languages. 

Let \w\ denote the length of a word w, and denote by C n the collection 
of all words of length n in £. For one-sided shifts, there is a one-to-one 
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correspondence between words w G £ and central cylinders 

[w] := {x G Xc \ xi = Wi for all 1 < i < \w\}. 

We have a similar correspondence between words and cylinders for two-sided 
shifts, with the caveat that we must keep track of where the cylinder begins: 
given w G C and k G Z, we define the cylinder 

k[w] := {x G X£ | = lOj for all 1 < i < 

We define the central cylinder for w G C n to be & [w] where = — |_n /2j . 

Given a collection of words V C £ and n > 1, let £> n = £> PI £ n be the 
set of words of length n in P. We denote the growth rate of the number of 
words of length n in T> by 

(2.2) h(V) = lim" - log#V n . 

n— >oo n 

The correspondence between words and cylinders implies that 

h(C) = ht op (Xc,a). 

Given collections of words A, B C we will write 

AB = {vw e C | v e A, w e B}. 

Note that only words in C are included in the concatenation AB. It may be 
the case that A and B are both non-empty, but a word in B cannot follow 
a word in A, and so AB = 0. 

We will occasionally write 0™ to denote the word that contains the symbol 
repeated n times. Usually, however, superscripts will denote indices — that 
is, we write w 1 , w 2 , . . . for a collection of words, so as to reserve the notation 
Wi for the ith entry of the word w. 

2.2. Specification properties. There are a variety of properties in the 
literature that go by the name "specification". They all have to do with 
the ability to approximate arbitrary orbit segments by a single trajectory. 
In the standard definition of specification due to Bowen, the time spent 
transitioning between orbit segments has a fixed length, independent of the 
length of the orbit segments. For symbolic spaces, this corresponds to being 
able to freely concatenate words using connecting words of fixed length. 
There are a number of variations on this definition, both in the classical and 
recent literature |Bow741 IDGS761 lPSu7l IVarTn] . 

We formulate specification properties that apply only to a subset of the 
space. Our definition applies to naturally defined subsets of many examples, 
such as /3-shifts, that do not have specification. 

Definition 2.1. Let C be a language (one- or two-sided) and consider a 
subset Q C C Fix t G N; either of the following conditions defines a speci- 
fication property on Q with gap size t. 

(S): For all m G N and w 1 ,... ,w m G Q, there exist v 1 ,... , v 171 ^ 1 G C 
such that x := w 1 v 1 w 2 v 2 ■ ■ ■v m ~ 1 w m G C and \v % \ =t for all i. 
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(Per): Condition (S) holds, and in addition, the cylinder [x] contains 
a periodic point of period exactly \x\ +t. 

When Q = C, we recover the usual specification property of Bowen. 

Remark. We stress that, in our definition, we only ask that x G C. We do 
not require that x G Q. 

Remark. A natural variant on this definition is to allow the connecting words 
v l in (S) to satisfy \v l \ <t for all i, rather than requiring the equality \v l \ = t. 
We refer to this as (W)-specification. Theorem [C] and Proposition 12.21 be- 
low still hold if (S)-specification is replaced with (W)-specification (one only 
needs to be a little more careful in the proof of Lemma l5.10p . while Proposi- 
tion [231 and Theorem ID1 require the stronger condition (S). In general, (Per) 
is stronger than (S), which is in turn stronger than (W); however, all three 
are equivalent in the case t = 0. 

2.3. Statement of Results. We consider languages C admitting a decom- 
position C = C P QC S — that is, there are collections of words C P ,G,C S C C 
such that every word in C can be written as a concatenation of a word from 
C p (a prefix), a word from Q (a "good" core), and a word from C s (a suffix), 
in that order (there may be more than one way to do this). 

Given such a decomposition, we define collections of words Q{M) for each 
M G N by 

g(M) = {uvw I u G C p ,v G G,w G C s , \u\ < M, \w\ < M}. 

Observe that Um>i £( m ) = J ~- 

Let Per(n) = {x G X \ a k {x) = x for some 1 < k < n} be the collection 
of periodic points of period at most n, and write 

(2 ' 3) Mn= #PerN £ 5x 

xGPer(n) 

for the probability measures evenly distributed across the points in Per(n). 
The following results apply equally to one-sided and two-sided shift spaces. 

Theorem C. Let (X, a) be a shift space whose language C admits a decom- 
position C = C P QC S , and suppose that the following conditions are satisfied: 

(I) G has (S)- specification. 
(II) h(C p UC s ) < h(C) = h top (X,a). 

(Ill) For every M G N, there exists r such that given v G G(M), there exist 
words u,w with \u\ < r, \w\ < r for which uvw G G- 

Then (X,a) is intrinsically ergodic. If G has (Per) -specification, then the 
sequence of probability measures (|2 . 3[) converges to the unique measure of 
maximal entropy. 

Remark. Condition OIB says that every word can be extended to a word in 
G, and that the length required to do so is controlled by the prefix length 
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and suffix length. Our proof will show that conditions ([!]) and ((TTJ) imply 

h{g) = h(c). 

Remark. The assumption that Q has (Per)-specification guarantees that the 
periodic orbit measures (j2.3|) are well defined. It is possible for a collection of 
words to satisfy (S)-specification without X containing any periodic orbits, 
in which case the measures in (I2.3D are not well defined. 

The following proposition tells us how the decomposition C = C P QC S and 
its properties behave under factors. 

Proposition 2.2. Let (X,a) be a shift factor of (X,o~), and denote the 
corresponding languages by C and C. If C admits a decomposition C = 
C P QC S , then C admits a decomposition C = C P QC S such that 

(1) If Q has (S)- specification, then Q has (S)- specification; 

(2) If Q has (Per) -specification, then Q has (Per) -specification; 

(3) h(C p UC s ) < h(C p UC s ). 

(4) If C- satisfies . then so does C. 

Combining Theorem ICl with Proposition 12.21 gives the following result. 

Corollary 2.3. Let (X, a) be a shift space whose language admits a de- 
composition C P QC S satisfying (QJj, 177]) and UII) . and let (X,o~) be a subshift 
factor of (X,o~) such that ht op (X,a) > h(C p L)C s ). Then X is intrinsically 
ergodic. If Q has (Per) -specification, then the sequence of probability mea- 
sures (|2.3p converges to the unique measure of maximal entropy for (X,a). 

There are a number of important examples for which the collection of 
prefixes and suffixes grows sub exponentially. Furthermore, we have the fol- 
lowing dichotomy for systems satisfying Conditions (jl]) and 0111) . 

Proposition 2.4. Let (X, a) be a shift space whose language has a decom- 
position C = C P QC S satisfying (jTJ) and (jllip . Then either X has positive 
entropy or X comprises a single periodic orbit. 

Since Conditions (jj) and are preserved by factors (Proposition I2.2|) , 
we obtain the following result, which gives a broad class of intrinsically 
ergodic systems that is closed under taking factors. 

Theorem D. Let (X,o~) be a shift space whose language admits a decom- 
position C P QC S satisfying conditions (jl]), (jlllj) and 

(ii) h(c p uc s ) = o. 

Then every subshift factor of(X,o~) is intrinsically ergodic. If Q has (Per)- 
specification, then the sequence of probability measures ()2.3p converges to 
the unique measure of maximal entropy for (X,o~). 

In Section [3l we will show that every /3-shift and S-gap shift has a language 
with a decomposition satisfying the conditions of Theorem [Dl which proves 
Theorem [A] 
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3. Application to /3-shifts, S-gap shifts, and their factors 

3.1. /3-shifts. We recall some facts about /3-shifts; further information can 
be found in [ParfiOl l,Toh99l IPSD71 IMai071 ITho051 ITholOj , among others. For 
any /3-shift, we describe a decomposition of the language which satisfies the 
hypotheses of Theorem |D| whence the first part of Theorem lAl follows. 

Fix a real number /3 > 1, and let b = [~/3] be the smallest integer greater 
than or equal to /3. The /3-shift C {0, . . . , 6} N is the natural symbolic 
space associated to the /3-transformation fp : [0, 1) i->- [0, 1) given by 

fp(x) = [3x (mod 1). 

There is a uniquely determined sequence w(/3) = (vjj([3))'jL 1 that is the 
lexicographic supremum over all solutions to the equation 

oo 

J>,(/3)/3- J = l. 

The /3-shift can be characterised by 

i£E ? ^ a k (x) * w(J3) for all k > 1, 

where ^ denotes the lexicographic ordering on {0, ...,6} N . In particular, 
for every k, a k (w((3)) X w(/3). 

A /3-shift is sofic if and only if /3 is eventually periodic, and has specifica- 
tion if and only if w(f3) does not contain arbitrarily long strings of zeroes; 
the set of /3 with this property has Lebesgue measure zero. Generically, 
then, a /3-shift is not sofic and does not possess the specification property 
[BM86llSch97] . 




Figure 1. A graph presentation of a /3-shift. 

Every /3-shift can be presented by a countable state directed labeled graph 
r^, which is shown in Figure 1 (see also [BH86t IPS071 ITholOj ) . We describe 
the construction of this graph, assuming that w(j3) is not eventually periodic. 
Let vi, «2, . . . be a countable set of vertices. We use the following two rules 
to add edges to this graph. Firstly, for every i > 1, we draw a directed edge 
from Vi to Vi + i and label it with the value Wi{j3). Secondly, if Wi((3) > 1, 
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then for every j G {0, 1, ... , Wi((3) — 1}, we draw a directed edge from Vi to 
v\ labeled with the value j. 

Note that if Wi((3) = 0, then the only edge which starts at vi is the edge 
from Vi to Vi + \ labeled by 0, and if Wi(f3) > then there is at least one 
edge from V{ to v\. The graph Tp characterises as follows: a sequence x 
belongs to if and only if x labels an infinite path of directed edges in Yp 
that starts at the vertex v\. 

It follows that words in the language C correspond to finite paths in the 
graph starting at v\. Let Q be the collection of words for which the 
corresponding path also ends at v\, and let C s be the collection of words for 
which the corresponding path never returns to v\. Let C p = 0. We can see 
from the graph that 

C = { Wl (f3)---w n ((3) |n>l}. 

This is because the only finite paths that never return to v\ are those which 
visit the vertices vi, V2, i>3, . . . in that order. 

Every path can be decomposed into a part that ends at v\ followed by a 
part that does not return, and so this gives us a decomposition C = C P QC S . 

Loops based at v\ can be freely concatenated and each such loop cor- 
responds to a periodic orbit, hence Q has (Per)-specification with t = 0. 
Furthermore, we have 

Q(M) = {vw \v G G,w G C s , \w\ < M}, 

and by taking r to be such that every path of length < M that begins at v\ 
can return to v% within r steps, we see that Condition (IHip is satisfied. 

Furthermore, = 1 for all n > 1, and so Condition (JXTJ ) holds. Thus 
every /3-shift satisfies the conditions of Theorem [Dl and this proves the part 
of Theorem lAl concerning /3-shifts. 

3.2. S"-gap shifts. An S'-gap shift is a subshift of {0, 1} Z defined by the 
rule that for a fixed S C {0, 1, 2, . . .}, the number of 0's between consecutive 
l's is an integer in S. More precisely, the language of is 

{0 n 10 ni 10 n2 l • • • 10 nfe 10 m | m G S for all 1 < i < k and n,m G N}, 

together with {0 n [ n G N}, where we assume that S is infinite. (If S is 
finite, then Eg is sofic.) The entropy of the S'-gap shift is log A, where A is 
the unique solution to 1 = X^neS 2 " ™ 1 - (^ ee !Wei70| or [LM951 Exercise 
4.3.7].) 

The language for Eg admits the following decomposition: 

Q = {0 ni 10 n2 l • • • 10 nfc l | m G S for all 1 < i < k}, 
CP = {0 n l \n(£S}, 
C s = {0 n \ n G N}. 

It follows immediately that Q has (Per)-specification with t = 0. Condition 
(ITU ) follows from the observation that #Cn < 1 and = 1 for every n. 
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It is not hard to see that Condition (jllip holds. (See also the discussion 
in Section HI) Applying Theorem iDl proves the remainder of Theorem El 



Remark. If every element of S is odd, then every periodic orbit in the cor- 
responding S'-gap shift has even period. This demonstrates that in the defi- 
nition of the periodic orbit measures (|2.3p . it is crucial that Per(n) denotes 
the periodic orbit measures of period at most n, rather than those of period 
exactly n. 



3.3. Specification properties for S-gap shifts. Specification and almost 
specification properties can be seen to fail for a generic S-gap shift. For 
example, let S = {2 n \ n G N}. Consider the words u = 10* and w = (PI. 
Suppose i + j = 2 n — k > 2 n ~ 1 . The shortest word v such that uvw is 
admissible is v = k . Choosing n and k large enough clearly shows that 
specification fails. Similarly, specification fails for any S-gap shift for which 
S has unbounded gaps (i.e. {nj + i — m \ Hi G S} is unbounded). 

We give the definition of almost specification and give a similar example 
which shows that almost specification fails. 

We say a non-decreasing function g : N i— > N is a mistake function if g(n) < 
n for all n and g(n) /n — > 0. We say a symbolic space has almost specification 
if there exists a mistake function g such that for every w , . . . , w n € C, there 
exist words v , . . . , v n G C with \v l \ = \w l \ such that v 1 v 2 ■ ■ ■ v n G C and each 
v l differs from w l in at most «7(|i>*|) places. This is a special case, adapted 
to symbolic dynamics, of the definition that appears in |PS07j. [TholOj . 

The space of shifts with almost specification is closed under factors, and 
every /3-shift has almost specification with the mistake function g(n) = 1 
(see [TholOj for details). It is an open question whether every shift with 
almost specification is intrinsically ergodic. We demonstrate that Theorem 
O applies to examples without the almost specification property by giving 
an example of an S'-gap shift where almost specification fails. 

Consider the S'-gap shift where S = {(2n)! j n G N}. Let g(n) be any 
mistake function. Let n be sufficiently large so that 2(g(2n) + 1)) < In. Let 
k = (2n)! and let u = \g(k)+i^-g(k)-\ ; and v = k- g (k)-i ig {k)+i_ Observe 

that both u and v are in C. Consider the word uv. There is no way to 
change uv into the constant sequence n . The number of 0's we can change 
to a 1 is bounded above by 2g(k). Each string of 0's we obtain must have a 
length belonging to S. Suppose our modified word has £ strings of zeroes of 
length m, . . . ,ri£ G S. Each of these lengths is bounded above by (2(n — 1))!, 

and £ < 2g(k) + 1. Thus, ^Li n i < 2 (f ( k ) + !)( 2 ( n ~ !)) ! < 2n(2(n - 1))!. 
There are at most ^g(k) + 2 < 8n + 2 entries of 1. Thus, if we were able 
to modify uv into an admissible word using 2g(k) mistakes, then its length 
would be strictly less than 2(2ra)! which is a contradiction. 
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4. Application to coded systems 

A shift space is coded if its language C is freely generated by a countable 
set of words — that is, if there exist words {u> n }„ e N C C such that 

C = {w n iw n z ■ ■ ■ W n k | 7li i N}, 

where {•} denotes closure under the operation of passing to subwords. We 
refer to the words w n as the generators of the coded system (X, a). For 
more information, see (BH861 [FF92J. 

The language of a coded system has a natural decomposition C = C P QC S 
for which Q has (Per)-specification. Namely, we may consider 

g = { w ni ---w nk | n { e N}, 

the collection of all concatenations of generators (note that here we do not 
allow passing to subwords), together with 

C p = {i s k {w n ) |n£N,l<fc< \w n \}, 
C s = {i p k {w n ) | n 6 N,l < k < \w n \}. 

That is, C p is the collection of all suffixes of generators, and C s is the collec- 
tion of all prefixes of generators. Conditions (jl]) and hold for much the 
same reasons as they did for the 5"-gap shifts. Indeed, it is immediate that 
Q has (Per)-specification with t = 0, so Condition (jl]) holds. 

For Condition . we observe that given wgC, there exists a generator 
w such that u = if Aw). Let t p (u) be the minimum value of \w\ over all 
such generators. Given u G C s , define t s (u) similarly, as the minimum value 
of \w\ over all generators w such that u is a prefix of w. 

Given M € N, define r s (M) and t p (M) by 

t s {M) = max{r s (ii) | u € C p , |u| < M}, 
r p (M) = max{r p (u) \u£C s , \u\ < M}. 

Let t(M) = r p (M) + r s (M); then given any word v € Q{M), there exist 
words x l ,x 2 with | a; 1 1 < r s (M) and \x 2 \ < t p (M) such that x 1 vx 2 € 
This proves Condition 

The number c n defined before Theorem [Bl is nothing but #(C p UC s ) n and 
so linin^oo - log c n = h(C p L)C s ). Thus, Theorem [Bl follows from Theorem ICl 
and Theorem |Pl We remark that our results on /3-shifts, 5"-gap shifts, and 
their factors (Theorem [A]) are a special case of Theorem [Bl 



5. Proof of Theorem O 

5.1. Uniform estimates on numbers of words. We obtain estimates on 
the growth rates of and i^Qn- The estimates in this section require 
only conditions (jl]) and (JTTJ) : condition (IHip will not be used until we prove 
the Gibbs property in Section 15.31 The following lemma is a special case of 
KH951 Lemma 18.5.3] or [Bow741 Lemma 2-3]. 
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Lemma 5.1. For every n, 

(5.1) #£„ > e nh( ~ c \ 

Proof. It is straightforward to obtain #£ m +n < (#£m)(#Ai), which yields 
#C kn < (#C n ) k , and upon taking logarithms, ^ log #£fc n < ^ log i^^n- 
Passing to the limit as A; — >• oo gives the result. □ 

Using condition ([I]), we can obtain an upper bound on f^Q n . 

Lemma 5.2. There exists C\ > smc/i that for all n, 

(5.2) #Qn<C ie nh ^\ 

Proof. Condition ((J) immediately implies that #£fc(n+t) > (#G n ) k , which 
gives 

^^log#A(n + .)>^#0, 

Sending to infinity, we obtain #G n < e^ n+t ' h ^' . □ 

This leads to an upper bound on #£ n by using the decomposition C = 
C P QC S together with the bound on C p and C s given in condition ([II)) . 



Lemma 5.3. There exists C2 > such that for every n, 

(5.3) #C n <C 2 e nh ^. 

Proof. Fix e > such that h(C p UC S ) < h(C) - e. Then there exists a 
constant C3 such that 

(5.4) #(C P U C") < C 3 e™(M£)-e) 

for every n. For every word x € £ n there are non-negative integers 
that sum to n and for which x can be decomposed as uvw, where u € Cf, 
v 6 and iy £ C|. Thus 

i+j+k=n 

< Cl Ci £ e< 



i+j+k=n 
n m 

me 



m=0 i=0 

(m + l)e 



m>0 

The sum converges and is independent of n, which completes the proof. □ 

In place of a lower bound for every #G n , which is not possible, we obtain 
the following estimate. 
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Lemma 5.4. There exist constants C4 > and N G N such that for every 
n G N, there exists I with n — N < £ < n such that 

(5.5) #Q t > C^ c \ 

Proof. Given j G N, write aj = jf-Qje^^ 1 -^ . Choose e > and C3 so 
that (|5.4p holds, as in the proof of Lemma 15.31 A similar calculation to the 
one there gives 

e nh ^<#C n < £ (#Cf)(#^)(#C|) 
This implies that 

< ^(m+l)e- me a n _ m . 

m=0 

Let iV be large enough such that C5 := C3" 2 — ^ m >Af( m + l)e~ me Ci > 0, 
and let C% = max{(m + l)e~ me | m G N}. Then we have 



\m=0 / \m>Af 



which yields 

AT-l 

Cfe ^ On-m > C5. 
m=0 

It follows that there exists n — N < £ < n such that ae > C§/ {NC§). □ 
We use a similar argument to obtain the following estimate for ( K M^j n . 
Lemma 5.5. For all 5 > 0, there exists M = M{5) G N such that for all n, 

(5.6) *§^>1-*. 

Proof. Let e > and C3 be such that (|5.4|) holds, and choose M so that 
Wf£ m >M(™ + l)e- m£ <<5- 

Once again, we use the decomposition property of C to write any word 
in C n as raw, where u G Cf , t> G and u> G C|, with i + j + k = n. If 
i + A; < M, then uvw G Q(M) n , and it follows that 



#£n<#£(M) n + (#cf)(#^)(#C, 



ft;- 



i+k>M 
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Now using (fO|) . ([S3]), and (pTTj) . we have 

#£n < #£(M) n + CiCf ^ e (i+fc)(M£)- £ ) e iM^) 

< #<?(M) n + CiCf (#£ n ) (m + l)e" m£ 

m>M 

<#g(M) n + 5{#Cn). □ 

An immediate consequence of Lemma [53] is that every collection of words 
that grows quickly enough has arbitrarily large intersection with Q(M), in 
the following sense. 

Lemma 5.6. Suppose V C C and C-j > are such that 

(5.7) #X» n > C 7 e nHC) 

for every n. Then there exist constants M € N and C$ > suc/i £/iai /or 
ewery n € N, 

(5.8) #(v n ng(M) n ) >C s e nh{c) . 

Proof. Let 5 > be sufficiently small that Cs := C 7 — 6C% > 0, and let 
M = M{8) be given by Lemma EH Then liOjl . (l5~3jl . and ((§21) yield 

#(p n n g(M) n ) > #v n - #(v n \ g(M) n ) 

> c 7 e nh ^ - #(c n \ g{M) n ) 

> c 7 e nh ^ - 5#C n 

> C 7 e nh{c) - 5C 2 e nh{c) = C 8 e nh{c) . □ 

5.2. Collections of cylinders with uniformly positive measure. For 

a collection of words T> and a measure v, we abuse notation slightly and 
write v{T> n ) = v (U w eD n M) ; where [w] is the central cylinder defined by w. 
We also write v(w) in place of ^([w]) where it will not cause confusion. 

Lemma 5.7. For all 7 £ (0, 1) there exists Cg > such that if 1/ is a 
measure of maximal entropy, n € N, and T> n C C n has v{T> n ) > 7, i/ten 

(5.9) #D n > C 9 e nh ( £ ). 

Proof. Recall from the definition of measure-theoretic entropy that 

h v (a) = lim -H v {A n ) = inf -H u {A n ), 

n-s-oo n n>l n 

where A n is the partition of X into n-cylinders, and 
^(^ n ) = -Kw)logv{w). 

W<=C n 
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Because h u (cr) = ht op (XjC,a) = h(C), this yields the following inequality for 
every n (we write 2?£ for the complement of T> n in C n ): 

nh(C) < —v{w)\ogv{w) 
weCn 

= ~ v(w) log v{w) + ~ v{w) log v(w). 
Normalising each sum yields 
nh(C) < v{V. 

(5.10) 

+ u(v: 

+ (-!/(©„) log " v{V C n )\ogv(p C n )). 

Recall that for any non-negative numbers ai, . . . aj. summing to 1, we have 

k 

^^-ailogai < log A;. 

i=i 

We apply this to the first sum in (|5.10p with the quantities replaced by 
tf^K , to the second sum with a» replaced by *ffi e \ , and to the last line with 
a\ = v(V n ) and a 2 = u(T>^). This yields 

nh(C) < v(V n ) log #V n + v(Vl) log #(V c n ) + log 2, 
Lemma [O] implies that #(X£) < #£„ < C 2 e nh( - C \ and so we have 
nh{C) < v(V n ) log #V n + (1 - v{V n )){log C 2 + n/i(£)) + log 2 

= i/(X> n ) log#P n + log(2C 2 ) + nh(C) - u(V n ){\ogC 2 + nh{C)). 
Rearranging and using the assumption that u(T> n ) > 7, this gives 
u(V n )log#V n > v(V n ){\ogC 2 + nh(C)) - log(2C 2 ), 

log #V n > log C 2 + nh(C) - l ° g ^ 

> log C 2 + nh(£)- 7- 1 log(2C 2 ), 
and exponentiating both sides yields (I5.9p , □ 



Lemma 5.8. For a// 7 G (0, 1) i/iere exists C\q > and M € N suc/i i/iai 
if v is a measure of maximal entropy, n E N, anc? P n C £ n /ias v{T> n ) > 7, 
i/ien 

(5.11) #(Pnne(M))>C 10 e n ^. 

Proof. This follows from Lemma 15.71 and Lemma 15.61 □ 
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5.3. A Gibbs property. Wc build a measure of maximal entropy fj> as a 
limit of (5-measures fi n evenly distributed across n-orbits. In particular, for 
every n we choose a finite set of points E n such that for every word w G C n , 
the central cylinder [w] contains exactly one element of E n . Consider the 
measures v n defined by 



^ n X(£E n 



In order to obtain invariant measures, we consider the measures 

n-l 



(5.12) A*n : =~I> 



IS Y) 



n 

k=0 



and let ^ be a weak* limit of the sequence {// n }- 
Lemma 5.9. fx is a measure of maximal entropy. 

Proof. This is proved in the second part of |Wal82l Theorem 8.6]. □ 

We prove a Gibbs property for the measure of cylinders corresponding to 
words in Q. 

Lemma 5.10. There exists C\\ > such that for every n G N and w G Q n , 

we have 

(5.13) M (H) >C u e- nh{C) . 

Proof. By Lemma 15.41 we have N G N, C4 > 0, and a sequence rij /* 00 
with Uj + \ — n,j < N such that for all j, 

(5.14) #Q nj > C A e n ^ c \ 

Consider w G Q n . We estimate fi m {[w\) for large m by estimating v m (a~ k ([w])) 
first. Let t G N be provided by condition (jl]). Fix k < m. If k — t < N, let 
^1 = 0. Otherwise, let ^1 G {n^} and satisfy k — t — N < l\ < k — t. Let 
^2 G {%} satisfy m — k — t — n — N < £2 < m — k — t — n. lim — k — t — n < 0, 
let ^ 2 = 0. 

First assume ^1,^2 > 0. It follows from condition that for every G 
Ge-i and G Ge 2 there exist words u l ,u 2 G £ with = t such that 
x := v 1 u 1 wu 2 v 2 G £. Extending x by at most ./V symbols at each end, we 
obtain a word y G C m . Different choices of v 1 and v 2 give different words y, 
which shows that 

Vm{0- ([W\)) > — . 

Tr>~"m 

If £j = 0, this formula still holds by setting #{7^ = 1. 

Since li G {nj}, we may use (|5.14j) and Lemma I5T31 to obtain 

u m (a- k (H)) > C%CZ 1 e& +t *MQe- mh W 
> C 2 C^e^ N+ ^ h{c h~ nhi - c \ 
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Writing C u = C\C 2 l e _2(A I+t ) h ( c ) and applying this to (EU2l) gives 

Pm(M) > C u e- nh( ~ c \ 
and passing to the limit gives f|5. 13j) . □ 



Up to this point, we have not used Condition (jllip at all. From now on 
we will use this condition as well, which will allow us to extend the Gibbs 
property in Lemma 15.101 to cylinders corresponding to words in Q(M), with 
the caveat that the constant in the Gibbs property decays as M — > oo. 

Lemma 5.11. For every M G N, there exists a constant Km > such that 
for every n G N and w G Q(M) n , we have 

(5.15) MM) > K M e- nh ^. 

Proof. Fix M G N and let r be given by condition . Then given w G 
Q(M) n , there exist words u,v with |u| < r, |u| < r, so that miw G £?. Since 
C ct _ I u I[u>], Lemma 15.101 gives 

Setting K M = Cue' 27 ' 1 ^ gives flgjg) . □ 

Finally, we observe that there is a uniform upper bound for the //-measure 
of an n-cylinder. 

Lemma 5.12. There exists a constant C\ 2 > such that for every n G N 
and w G £ n , we Ziaue 

(5.16) //(H) < Ci 2 e- n/l(£) . 

Proof. Fix m> n and & < m — n. By Lemmas 15.31 and 15. 1| we have 

-fc/r m / (#A0(#£m-fc-n) 



M^~ fe (M)) < 



It follows that // m (H) < C|e n M-£). Passing to the limit as m -> oo, we 
obtain (|5~16jl with d 2 = C|. □ 

Recall that given a set of words V n C C n , we write /i(P n ) = MUu>ex> n 

Lemma 5.13. Zei <$i > 0. There exists M so that for all n, any subset 
T> n C C n satisfies 

n(v n ng(M))>fi(v n )-5i. 

Proof. Let 5 = (C^C^) -1 ^. Lemma 15.51 provides M G N such that for all 

Tt 

#G{M) c n = #(C n \ Q(M) n ) < 5#C n . 
Combining this with Lemma 15.121 and Lemma 15.31 gives 

KG(M) c n ) < 5#£ n C 12 e- nh ^ < SC 2 C 12 = S u 
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and we have 

fi{v n ) = /x(v n n g{M)) + fi(v n n g(M) c ) 

< ^{v n n g{M)) + n{Q(M) c n ) < fi(v n n g{M)) + 5 U 

as required. □ 

5.4. Proof that fj, is ergodic. We need to show that the measure [i is 
ergodic. This is a direct consequence of the following result. 

Proposition 5.14. If two measurable sets P,Q C X both have positive 
^-measure, then linin^oo //(P n a~ n (Q)) > 0. 

Proof. We begin by considering the case where P and Q are cylinders cor- 
responding to words in Q. 

Lemma 5.15. There exists C13 > and rrij — > 00 such that if u,v € Q, 
then for all sufficiently large j, 

(5.17) M(Hn^'H) > dsMMMM)- 

Proof. As in the proof of Lemma l5.10^ we use Lemma 15.41 to obtain JV£N, 
C4 > 0, and a sequence nj f~ 00 with n-j+i — rij < N such that (|5.14j) holds 
for all j. Let rrij = nj + 2t. 

Consider u,v € Q. Let m € N be large and fix k < m. We estimate 

u m (a- k [u]na^ k+m ^[v])). 

By a similar argument to Lemma l5.10( we obtain 



v m {o-~ k [u] D a~^ k+m ^[v])) > 



where £\ = for some i\ and satisfies k — t — N < £\ < k — t (or is if no 
such number exists), and similarly, £2 = n i 2 an d satisfies 

m — 3t — rrij — \u\ — \v\ — N < £2 < m — 3t — rrij — \u\ — \v\, 

(or if no such £2 exists). Using Lemmas 15.41 15.31 and 15.121 we obtain 

v rn (o-- k [u] n<j- (fc+m ^H)) > ciC2e {il+i2+n > )hic) e- mh{c) 

> clC2e~ 2Nh{c) e~ {lul+M)h{c) 

> CtC 2 e- 2Nh ^C^f,([u}U[v}). 

Writing C13 := ClC 2 e~ 2Nh ^ C{ 2 , this yields 

M m (Mn ff -^H))>c 13 /i(MHH), 

and passing to the limit as m — > 00 gives the required estimate. □ 
This result immediately extends to unions of cylinders from Q n . 
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Lemma 5.16. Let C13 > and rrij — > 00 be as in Lemma \5.15[ and consider 
P C Q n , Q C G n > ■ Let [P] = \J w€ p[w], and similarly for Q. Then for all 
sufficiently large j, 

(5.18) n([P] n a~ m * [Q]) > C 13 M([P])/i([Q]). 
Proof. This is a straightforward computation. 

M ([P] n [Q]) = m(M n ^" mj 

toGP 

> £ c 13y u(MMKD = c 13M ([p]) m ([q]). □ 

Using Condition (lllip . this result generalises to unions of cylinders from 
G{M) n . As before, given P C £ n , we write n(P) = fi([P]) = M(UwepH)- 

Lemma 5.17. Given M G N, i/iere exists a constant K' M such that for 
every P C Q(M) n and Q C Q{M) n i , we have 

(5.19) lhn ^(P n a" m (Q)) > K' Mf i(P)n(Q). 

m— >oo 

Proof. For each n; G Q(M), using Condition . we can choose x(w),y(w) G 
£ such that |ar(ty) | < r, < r, and x(w)wy(w) G £/. Given < 

i,j,i',f < r, let 

P(i,i) = {w e P I i = 1^(10)1, j = |y(u>)|}, 

Q(i , ,/) = KEQ|i , = |x(«/)|,j' = |y(«/)|}. 
There exist such that 

#P(i,j) > (r + l)- 2 #P, 
#Q(i',/) > (r + l)- 2 #g. 

Now let 

P = {x(w)wy(w) I u> G P(i,j)} C ^n+i+i) 
Q = { x (w')w'y{w') I w/ G /)} C £/ n > +i / + j>. 
We can estimate the left-hand side of ()5.19p by observing that 

lhn / u(Pna-( m+i ' + ^(Q)) > lhn ^n^W)) > d 3 /i(P)//(Q), 

m— >oo m— >oo 

where the second inequality follows from Lemma 15.161 Furthermore, the 
Gibbs properties (15.13h and fj5 .161) imply that 

M(P) > #P(i,j)Cue-( n+i+ » h W 

> ( T + l)- 2 #PCne- (l+J ' )/l(£) e- nh(£) 

>(T + i)~ 2 c lie - 2 ^c 12 V(p), 

A similar estimate on fi(Q) suffices to complete the proof. □ 
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Lemma 15.171 is the key tool in the proof of the following lemma. 

Lemma 5.18. Suppose 8\ > and M are such that Lemma \5.13\ holds 
and let K' M be as in Lemma \5.11\ Then for every pair of measurable sets 
P, Q C X , we have 

(5.20) Tim" /i(P n <7-*(Q)) > K' M (ji(P) - - <5i). 

n— yoo 

Proof. Fix e > and choose sets U, V that are unions of cylinders of the 
same length and for which fj,(U A P) < e and fj,(V A Q) < e. Let U' C U 
be the union of all cylinders in U corresponding to words in G(M), and 
similarly for V C V. By Lemma 15.131 we have n(U') > (i(U) — 6\ and 
n(V) > fJ,(V) — Si, and furthermore, by Lemma f5. 171 

(5.21) Ihn" fj,(U' n o-- n (V')) > K' M n(U')fi(V'). 

n— >oo 

We have U n o-- n {V) D U' n ^""(V'), and so 

(5.22) ImT n cr -ri (F)) > K' M {fi{U) - S^V) - Si). 

n— >oo 

We also observe that 

\n(u n a~ n (V)) - n{P n a~ n (Q))\ < n((U n <J- n {V)) A (p n <r- n (Q))) 

< /i((?7 A P) n a- n (y A Q)) < e 

for every n, which together with ()5.22j) implies 

hm fi(P n a- n {Q)) > K' M (ji{P) - 6i)(n(Q) - Si) - e. 

n— >oo 

Since e > was arbitrary, (|5.20p follows. □ 

Now let P, Q C X be any measurable sets with positive ^-measure. For 
sufficiently small Si > 0, the right hand side of (|5.20p is positive, which 
completes the proof of Proposition 15. 141 □ 

5.5. Contradiction if there is another mme. Let /i be the ergodic mme 
constructed in the previous sections, and suppose that some ergodic measure 
v _L n is such that h u (a) = htop(Xc,a) = h(C). Let V be a collection of 
words such that v(V n ) — > 1 and fi(T> n ) — > 0. Applying Lemma ESI we see 
that there are constants Cio > and M £ N such that 

#(2\ n G(M)) > C w e nh ^ 

for every n. Now we use the Gibbs property (|5.15p to observe that 

li(V n ) > fi(V n n Q{M)) > K M e- nh ^#(V n n Q{M)) > K M C W > 0, 

which contradicts the fact that n(T> n ) — > 0. This contradiction implies that 
any mme v is absolutely continuous with respect to fi, and since fi is ergodic, 
this in turn implies that v = fi, which completes the proof of the theorem. 
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5.6. Characterisation of the unique mme. We prove the final statement 
in Theorem O under the assumption that Q satisfies (Per)-specification. By 
Lemma 15.41 there exist constants C4 > and N E N such that for every n, 
there exists n — N < I < n for which 

#Qt > cV M£) . 

By (Per)-specification, every word w E Qt determines a periodic orbit of 
length I + t < n + t, and so we have 

#Per(n + t) > C 4 e ih ^ > C 4 e^- N ^ £ \ 

This in turn yields 

■ log # Per(n + t) > — ^ + h(C), 



n + t n +t n+t 

and so lim n _ ) . 00 ^log#Per(n) = h(C). Standard arguments such as those 
in the proof of [Wal82[ Theorem 8.6] show that any limit measure v of the 
sequence /j, n in (I2.3[> has h u (a) = h(C). Since we showed that fi is the unique 
measure of maximal entropy, this shows that the sequence /i n converges to 

6. Proofs of other technical results 

6.1. Proof of Proposition 12.11 Let e denote the empty word, so ew = 
we = w for every w E C, and [e] = X^. A measure /u induces a function 
m: C — >■ [0,oo) by m(w) = n([w]), and this gives a one-to-one correspon- 
dence between M a (Xc) and functions m: £ — > [0, 1] satisfying 

(1) m(e) = l; 

(2) for every w E C we have m(w) = J2a=i m {wa) = Yla=i fn(aw). 
Because, by cr-invariance, the starting point of the cylinder makes no 

difference to the measure, there is also a one-to-one correspondence between 
A4 a (Xc) and functions m satisfying the conditions above. This shows that 
the invariant measures of Xc and Xc can be identified. Furthermore, the 
entropy of \i is determined by m, so this identification preserves entropy. 

6.2. Proof of Proposition 12.21 We study the behaviour of the decom- 
position C = C P QC S and its properties under factors, proving Proposition 



In the following, we sometimes write v ■ w in place of vw to denote con- 
catenation. Let S C Hp and S C Sp be arbitrary closed two-sided invariant 
subshifts (the one-sided case is similar), and suppose that E is a topological 
factor of £ — that is, there exists a continuous and surjective map tt: £ 1— > S 
such that a o tt = n o a. By the fundamental result of Curtis-Lyndon- 
Hedlund }LM95l Theorem 6.29], tt is a block code: there exist k E N and 
4>: C2k+i — >• {1, • • • ,P} such that 

(irx) n = (p(x 
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This induces a map : C n +2k — > £-n by 

• • • W n+2 k) = <f>(.Wl ■ ■ ■ W 2k+ l)(j)(w 2 ■ ■ ■ W 2k+2 ) ■ ■ ■ <P{Wn ' ' ' W n+2k ). 

The map $ : C — >• C has the following important properties: 

(1) <3? is surjective. 

(2) For every word w £ C and x G n[n>], we have 7r(x) G n+k [^(w)]. 

If A; = 0, then $ is a homomorphism in the sense that §>(vw) = <&(v)$>(w) 
for all words v, w G C. For k > 0, we need to define maps on £ that extract 
prefixes and suffixes: in particularly, consider maps ifcjifc' C->k — ^ ^fc given 

by 

=w%...w k , 

i s k (w) = W\ w \_ k+ i . ..W\ w \. 

Now <£> has the property that for every v,w G C, we have 

*(wo) = $(v)<f>(i s 2k (v) • 4(w))*H 
(6.1) =*(«)*(»5 A (u).u;) 

Given a decomposition C = C P QC S , the obvious thing to do is to define 
a decomposition of C by applying <3? to each of C p , Q, and C s . Because 
the homomorphism property of $ takes the form (I6.ip . we must alter this 
slightly and define subsets of C by 

& = *(c ■ 4(g)), 
c s = $fe(g)-c s ). 

Given u G C p , v E Q, and w 6 C s , we see from (16. lj) that 

$H = $(uw)$(i§ fc (u) • w) = *(« • i2 fc (v))*(«)*(*2*(t0 • w) G C P £C S . 

This gives the decomposition of C claimed in Proposition ^. 2i Furthermore, 
we observe that for every M, we have 

$(0(Af)) = {$(ot) I u G C p ,u G </,«; G C s , \u\ < M, \w\ < M} 

u G C p , v G G, w G C s , |u| < M, |w| < M} 

= G(M). 

Now suppose has (S)-specification. Then given w , . . . , w n G Q, we have 
w 3 = ^{y 3 ) for some u J G G, and by (jj) there exist ar, ... ,x n ~ l G Ct such 
that 

o 1 iW-i n - 1 » n G C. 
Applying $ and writing y 3 = <&{i s 2k {v 3 ) ■ x 3 ■ i 2k (v > we have 
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Thus Q has (S)-specification with connecting words of length t + 2k. Since n 
takes periodic orbits to periodic orbits, (Per)-specification is also preserved 
by $. 

Condition is clearly preserved by <3?. If Q{M) satisfies Condition (jlllj) 
then for every v G Q(M), there exists u, v with \u\ <t + 2k and \w\ < t + 2A; 
such that uvw G Q. 

Finally, we observe that jfCn < (#£2fc)(#Cn), and similarly for C s , which 
completes the proof of Proposition 12.21 

6.3. Proof of Proposition 12.41 We show that (S)-specification implies 
positive entropy unless the collection Q has a very specific structure. 

Lemma 6.1. Let (X,a) be a shift space whose language C contains a collec- 
tion of words Q C C with (S)-specification, and suppose that ht op (X,a) = 0. 
Then given any v,w G Q, we have vCtw R wCtv ^ 0; that is, there exist 
y,z G Ct such that vyw = wzv. 

Proof. Fix v ^ w € Q, and suppose vCtwdwCtv = 0. We will conclude that 
htop(X,cr) > 0, which suffices to prove the lemma. 

Given N G N, consider £ G {1,2}^. Using (S)-specification, there exists 
cc(£) G C having the form 

a(Z)=a 1 b 1 a 2 b 2 ---b N - 1 a N , 

where b l G Ct, an d where 

i JvCtw i = 1, 

1 r* 

I w£tf i = 2. 

Because vCtw R wCtv = 0, the map a is injective. Furthermore, writing 
m = \v\ and n = |to|, each word a(£) has length iV(m + n + 2t) — t, and it 
follows that 

— ■ — — log #£N{m+n+2t)-t > T77 : -TTT[ 7 1°S 2^- 

TV (m + n + It ) — t y ' TV (m + n + It) — t 

Taking a limit as TV — > oo yields h top (X, a) > m+ ^ +2f log 2 > 0, which 
contradicts the zero entropy assumption. □ 

From now on we assume that ht op (X,a) = 0, and aim to show that X 
comprises a single periodic orbit. First we show that every word in Q is a 
prefix of a single infinite sequence. 

Lemma 6.2. There exists iel such that every word w G Q is of the form 
w = x% ■ ■ ■ x n for some n. 

Proof. It suffices to show that for any v, w G Q with m = \v\ < \w\, we have 
w\"' w m = v. This follows from Lemma [6.1( since there are words y,z G Ct 
such that vyw = wzv, and comparing the first m symbols of this common 
word gives the result. □ 
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Lemma 6.3. Given v,w 6 Q, write m = \v\, n = \w\, and suppose that n > 
m. Then w is (m + t) -periodic; that is, Wi = Wj whenever j = i mod (m + t). 

Proof. We show that uu is (fc+i)-periodic, where k+t = gcd(m+i, n+t). Let 
u l € Ck and x % 6 Ct be such that x = u l x l u 2 x 2 ■ ■ ■ , where x is the sequence 
from Lemma IB31 Write a = ^| and b = then u = ■u 1 x 1 u 2 • • • x a ~ l u a 
and it) = u l x l u 2 ■ ■ ■ x h ~ l u h . 

By Lemma [6. 11 there are words y,z £ Ct such that vyw = wzv. Therefore, 
the following two expressions represent the same word: 

u l x l ■■■x a - 1 u a y u l x 1 ■ ■ ■ x b - a - 1 u b - a x b - a u b - a+1 ■■■u b , 

uV • • • x a - l u a x a u a+l x a+l ■ ■ ■ x b - 1 u b z u l ■■■ u a . 

Comparing the subwords u 1 and x l in these two expressions, we see that 
u l = u 3 and x % = x 3 whenever i = j mod a. This follows from a comparison 
of the two middle segments, where the words w overlap. 

Similarly, a comparison of the final segments, where v appears as a suffix 
of w, shows that u b ~ l = u a ~ l for all < i < a and x b ~ l = x a ~ l for all 
1 < i < a. Since gcd(a, b) = 1, this is enough to show that u 1 = u 2 = ■ ■ -u b 
and x 1 = ■ ■ ■ = x b ~ l . □ 

We use Lemma 16.31 to show that the sequence x € X constructed in 
Lemma 16.21 is periodic. Indeed, if we fix v G Q and let m = \v\, then x is 
(m + £)-periodic. To see this, observe that Lemma 16.31 establishes (m + t)- 
periodicity for every w £ Q with > m. By Condition . Q contains 
arbitrarily long words; consequently, there are arbitrarily large values of n 
such that (m + t)-periodic, and this completes the proof. 
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